PS-ARM: An End-to-End Attention-Aware Relation Mixer Network for Person Search

نویسندگان

چکیده

Person search is a challenging problem with various real-world applications, that aims at joint person detection and re-identification of query from uncropped gallery images. Although, previous study focuses on rich feature information learning, it’s still hard to retrieve the due occurrence appearance deformations background distractors. In this paper, we propose novel attention-aware relation mixer (ARM) module for search, which exploits global between different local regions within RoI make it robust against occlusion. The proposed ARM composed block spatio-channel attention layer. introduces spatially attended spatial mixing channel-wise channel effectively capturing discriminative features an RoI. These are further enriched by introducing where foreground discriminability empowered in space. Our generic does not rely fine-grained supervisions or topological assumptions, hence being easily integrated into any Faster R-CNN based methods. Comprehensive experiments performed two benchmark datasets: CUHK-SYSU PRW. PS-ARM achieves state-of-the-art performance both datasets. On PRW dataset, our absolute gain 5% mAP score over SeqNet, while operating comparable speed. source code pre-trained models available https://github.com/mustansarfiaz/PS-ARM .

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

End-to-End Deep Learning for Person Search

Existing person re-identification (re-id) benchmarks and algorithms mainly focus on matching cropped pedestrian images between queries and candidates. However, it is different from real-world scenarios where the annotations of pedestrian bounding boxes are unavailable and the target person needs to be found from whole images. To close the gap, we investigate how to localize and match query pers...

متن کامل

Attention-Based End-to-End Speech Recognition on Voice Search

Recently, there has been an increasing interest in end-to-end speech recognition that directly transcribes speech to text without any predefined alignments. In this paper, we explore the use of attention-based encoder-decoder model for Mandarin speech recognition and to the best of our knowledge, achieve the first promising result. We reduce the source sequence length by skipping frames and reg...

متن کامل

JEJUNAL EVERSION MUCOSECTOMY AND INVAGINATION: AN INNOVATIVE TECHNIQUE FOR THE END TO END PANCREATICOJEJUNOSTOMY

 ABSTRACT Background: The pancreatojejunostomy has notoriously been known to carry a high rate of operative complications, morbidity and mortality, mainly due to anastomotic leak and ensuing septic complications. Objective: In order to decrease anastomotic leak and its attendant morbidity and mortality in operations requiring a pancreato-jejunal anastomosis, and also in order to simplify the op...

متن کامل

End-to-End Detection and Re-identification Integrated Net for Person Search

This paper proposes a pedestrian detection and reidentification (re-id) integration net (I-Net) in an end-to-end learning framework. The I-Net is used in real-world video surveillance scenarios, where the target person needs to be searched in the whole scene videos, while the annotations of pedestrian bounding boxes are unavailable. By comparing to the successful CVPR’17 work [Xiao et al., 2017...

متن کامل

Comprehensive end-to-end test for intensity-modulated radiation therapy for nasopharyngeal carcinoma using an anthropomorphic phantom and EBT3 film

Background: In head and neck radiotherapy, immobilization devices can affect dose delivery. In this study, a comprehensive end-to-end test was developed to evaluate the accuracy of radiotherapy treatment. Materials and Methods: An Alderson Radiation Therapy (ART) anthropomorphic phantom with EBT3 film was used to mimic the actual patient treatment process. Ten patients treated for nasopharyngea...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Lecture Notes in Computer Science

سال: 2023

ISSN: ['1611-3349', '0302-9743']

DOI: https://doi.org/10.1007/978-3-031-26348-4_14